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Abstract 


Continuous exponential families are applied to linking forms via a single-group design. In this 
application, a distribution from the continuous bivariate exponential family is used that has 
selected moments that match those of the bivariate distribution of scores on the forms to be 
linked. The selected continuous bivariate distribution then yields continuous univariate marginal 
distributions for the two forms. These marginal distributions then provide distribution functions 
and quantile functions that may be employed in equating. Normal approximations are obtained 
for the sample distributions of the conversion functions. 
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Application of continuous exponential families to linking has been considered for 
equivalent-groups designs (Haberman, 2008). In such an application, it suffices to consider 
univariate continuous exponential families. For single-group designs for linking forms, bivariate 
continuous exponential families may be used. In this approach, the member of the bivariate 
continuous exponential family used to link two forms is chosen to share selected moments with 
the joint distribution of the scores on two forms to be linked. Once the bivariate continuous 
distribution is selected, one then has a continuous marginal distribution that corresponds to each 
of the forms under study. These marginal distributions may then be used to construct conversion 
functions that link the forms. This procedure is readily implemented. For random sampling, the 
estimated conversion functions are easily computed, and, for each point at which a conversion is 
desired, the estimated conversion function has an approximate normal distribution with mean 0 
and an asymptotic covariance matrix that is readily estimated. Results in this report are related 
to recent work on an alternative to kernel equating (Wang, 2008), although the models in this 
report are more general and numerical methods are somewhat different. 

Section 1 describes the proposed continuous exponential families for bivariate distributions 
and defines conversions based on these families are defined. Different models are compared by use 
of an information criterion (Haberman, 2008). Section 2 develops estimates of parameters and 
conversion functions and considers the large-sample properties of estimated conversions in terms 
of consistency and asymptotic normality. Estimated asymptotic standard deviations are provided 
for the estimated conversions. Section 3 presents a Newton-Raphson algorithm for computation of 
parameter estimates and discussed numerical quadrature issues. 

Section 4 illustrates results by use of a published example to which kernel equating has 
previously been applied (von Davier, Holland, & Thayer, 2004). 

Conclusions and possible further developments are examined in section 5. 

1 Bivariate Continuous Exponential Families 

In the equating problem considered, a sample of n examinees receives two different forms. 
For Form j, where j is 1 or 2, possible scores are in the closed interval with finite lower bound 
Cj and finite upper bound dj > Cj. For examinee i, 1 < i < n, the score on Form j is . It is 
assumed that the pairs (Xu , X^), 1 < i < n, are mutually independent and identically distributed 
random vectors such that X^j has distribution function Fj. No requirement is imposed that the 
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scores be integers or rational numbers. Nonetheless, in typical applications, Fj is not continuous, 
so that equating forms based on observed scores and some equipercentile approach normally 
involves some approximation of the distribution function Fj by a continuous distribution function 
Gj, which is strictly increasing on some open interval B(j ) that contains both Cj and dj. For each 
positive real p < 1, there exists a unique continuous and increasing quantile function Rj such 
that Gj(Rj(p)) = p. The linking function ei 2 for conversion of a score on Form 1 to a score on 
Form 2 is then ei 2 (x) = R 2 (G\[x )) for x in B{ 1), while the linking function e 2 i for conversion of 
a score on Form 2 to a score on Form 1 is e 2 i(x) = i?i(G , 2 (x)) for x in B( 2). Both ei 2 and e 2 i 
are strictly increasing and continuous on their respective ranges, and ei 2 and e 2 i are inverses, so 
that ei 2 (e 2 i(x)) = x for x in B( 2) and e 2 i(ei 2 (x)) = x for x in B\ (Haberman, 2008). If g\ is 
continuous at x in B{ 1) and 52 is continuous at e\ 2 (x), then application of standard results from 
calculus shows that ei 2 has derivative e' 12 {x) = gi(x)/g 2 (e\ 2 (x)) at x. Similarly, if g 2 is continuous 
at x in B{2) and g\ is continuous at e 2 i(x), then e 2 i has derivative e' 21 (x) = g 2 (x) / gi(e 2 i(x)) at x. 

One method to obtain a suitable pair of distribution functions G\ and G 2 is to approximate 
the joint distribution of X, = {Xu, Xj 2 ) by use of a bivariate continuous exponential family. Let 
B{j ) be bounded for 1 < j < 2. Let u^j be a polynomial of degree k on the interval B(j ) for k > 0 
and 1 < j <2. For a pair k = {k{ 1), k{ 2)) of nonnegative integers, let Uk be the polynomial on the 
plane such that Uk(x) = u^iji{xi)u^ 2 ) 2 { x 2 ) for real pairs x = (x\,X 2 )- Let be the expectation 
of rtk(Xj), so that /ik is a linear combination of the bivariate moments E{X^^X^ 2 ^) of X,; for 
integers j(l) < k{ 1) and j{ 2) < k{ 2)). Consider a nonempty set K of r pairs of nonnegative 
integers k = {k(l),k(2)) such that k{ 1) or k{ 2) is positive. Let g K be the JC-array of g^. k in 
K, and let u A -(x) be the /i-array of ttk(x), k in K. If yx is a real iv-array of y^, k in K, and 
z k is a real iv-array of k in K , then let y' K zx be the summation X^keft" Vk z k- Assume that, 
for any real iv-array yx-, the variance of y' K ux(X-i) is 0 only if = 0 for each k in K. Let 
B = B{ 1) x B{ 2) be the interval in the plane that consists of pairs (6(1), 6(2)) such that 6(1) is in 
-B(l) and 6(2) is in B{ 2). Then a unique continuous bivariate distribution with positive density on 
B has the exponential family density 

5A-(x) = 7 A'( 0 /v)exp[ 0 ' A -UA'(x)], 

x in B, for a unique A'-array Ox with elements $kAX k in K, and a unique positive real 7 jv(0a') 
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such that 


for k in K and 


Wk(x)ffA'(x)dx = // k 


/ £f A '(x)dx = 1 
J B 


(Gilula & Haberman, 2000). A random vector Y x = (YixtYzx) then exists such that Yjx is in 
B(j) for 1 < j < 2 and Y k has density gx- The moment equalities E(uk(Y x)) = X(u k (X,;)) hold 
for k in K, so that Y k - has a distribution close to that of X in the sense that the expected log 
penalty function lx = E(—loggx(X)) is the smallest expected log penalty function E(— log< 7 (X)) 
for all probability densities g on B such that 


fif(x) = 7A(0A*)exp[0' A ^UA'(x)] 


for some real X'-array 6x *, and E(— logg(X)) = lx only if Ox* = Ox- 

The moment equations expressed in terms of u k can be interpreted in terms of conventional 
moments if the set K satisfies the hierarchy rule that (k(l),k(2)) is in K whenever (h(l),h(2)) 
is in K, k{ 1) < h( 1), k( 2) < h( 2), k(l) and k( 2) are nonnegative integers, and k( 1) or k( 2) 
is positive. The equations £ , (« k (Yx)) = X(u k (X)) for k in K then hold if, and only if, 
E{Y^ ) Y^ ) ) = E{X^ (1) X% 2) ) for all k in K. 

For 1 < j <2, the distribution function Gjx of Yjx is strictly increasing and continuously 
differentiable on B(j). If B(j,y), y in B(j), consists of all pairs (j/ 1 , 2 / 2 ) such that y\ is in B( 1), 2/2 
is in B( 2), and yj < y, then 

Gjx(y) = f 5 (x)dx. 

J B(j,y) 

The inverse Rjx defined by Gjk{Rjx{p )) for 0 < p < 1 is also continuously differentiable and 
strictly increasing, so that the conversion functions ei 2 A = R 2 k{G\x) and e 2 iA = Rik(G- 2 x) are 
also continuously differentiable and strictly increasing. 

As in the case of univariate exponential families (Haberman, 2008), numerical work is 
simplified if computations employ the Legendre polynomials Pi- for k > 0 (Abramowitz & Stegun, 
1965, chaps. 8, 22). These polynomials are determined by the equations Pq{x) = 1, P\(x) = x, 
and 

Pk+i(x) = (k + l)' 1 ^ 2 ^ + 1 )xPk(x) - kP h - i(x)], 
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A; > 1. If inf'( B(j )) is the infimum of B(j ) and sup (B(j)) is the supremum of B(j), then it is 
relatively efficient for numerical work to let f3j = [inf(L>(j)) + sup(B(j))]/2 be the midpoint of 
B(j), rjj = [sup(B(j)) — inf(S(j))]/2 be half the range of B(j), and 


u k j(x) = P k ((x - Pj)/Vj)- 


In applications considered in this report, for integers r(j) > 1, 1 < j < 2, the set K 
consists of the r(l) + r(2) + 1 elements (fc(l), 0), 1 < k( 1) < r(l), (0, k( 2)), 1 < k( 2) < r( 2), and 
(1,1), so that the hierarchy principle holds, Yjk and X\j have the same r(j) initial moments for 
1 < j < 2 and Y\ k and Y- 2 k have the same correlation as X % \ and X t 2 - Thus Yjk and X t j have 
the same mean and variance for each j. If r(j) > 2, then Yjk and X k j have the same skewness 
coefficient. If r(j) > 3, then Yjk and Xij have the same kurtosis coefficient. In the case of 
r(l) = r(2) = 2 and use of Legendre polynomials, if 0^ is negative for k equal to (2,0) or (0,2) 
and 6^ ^ is less than 360^ 2O ^0^ O2 j, then Y x corresponds to a bivariate normal random variable 
Z = (Z\. Z 2 ). The distribution of Y x is the same as the conditional distribution of Z conditional 
on Z.j being in B(j) for 1 < j <2. Comparison of the density formula for the bivariate normal 
with the density formula for the corresponding continuous exponential family shows that the 
unconditional variance of Z\ satisfies 


^ 2 ( Z, 1 ) = - 

the unconditional variance of Z 2 is 

<?\Z 2 ) = - 


vf 


6^(2,0) - ^1,1)7(60(0,2)) 




60(0,2) - 0(1,1)/(60(2,0)) 
the unconditional correlation of Z\ and Z 2 is 


p(z 1 ,z 2 ) = d {hl) /[6(0 m e^ 2) ) 112 }, 


the unconditional mean of Z\ is 


E(Z 1 ) = (3\ 


Vl [0(1,0) _ 0(O,1)0(1,1)/ (60(0,2))] 

60(2,0) _ 0^1,1)7(60(0,2)) 


and the unconditional mean of Z 2 is 


E(Z 2 ) = p 2 


V2 [0(0,1) - 0(1,0)0(1,1)7(60(2,0))] 
60(0,2) - 0fi, 1)7(60(2,0)) 
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One alternative choice of I\ (Wang, 2008) has K contain all pairs (k(l),k(2)) of 
nonnegative integers such that k( 1) or k( 2) is positive, /c(l) < r(l), and k{ 2) < r(2). If ?’(1) > 1 
and r(2) > 2, then E(Y^ ) Y^ ) ) = E(X^ 1] X^ 2] ) for 1 < k{ 1) < r(l) and 1 < k{ 2) < r( 2). 


2 Estimation of Parameters 

The parameter 6k, the information criterion Ik, the distribution functions GjK, and the 
conversion functions e\ 2 K and e 2 ii< are readily estimated (Gilula & Habernran, 2000; Habernran, 
2008). For k in K , let be the sample mean n -1 Y17=i u k(X t ), and let m A ' be the JF-array with 
elements m^, k in K . If the covariance matrix of m A ' is positive definite, then 6k is estimated 
by the unique K- array dj< such that 


and 


/ u^(x)^(x)dx = m K , 
J B 


g K {x)dx = 1 , 


<M X ) = 7(^A')exp[0^ux(x)] 


for x in B. 

As the sample size n approaches oo, 6k converges to 6k with probability 1 , and 
v}G(6k — 6k) converges in distribution to a multivariate normal random variable with 0 mean 
and with covariance matrix A k = C^- 1 D kCJ^ (Gilula & Habernran, 2000). Here D k is the 
covariance matrix of u A -(X) and C k is the covariance matrix of the X-array u;<-(Y A -). Thus 

C K = / [ujf(x) - /i^]K(x) - ii K }'9K{*)dx. 

J B 

The estimate of C k is 


C K = / [u/t (x) - mx][uA'(x) - m K ]'gK{x)dx. 

J B 


The estimate of D k is 


Da' = (n - 1) 1 y^[uAr(Xj) - m A '][u A '(Xj) - m^]'. 

i =1 

Thus A k has estimate 

A K = C^DaC^ 1 . 
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For any nonzero constant A"-array z k, the estimated asymptotic standard deviation (EASD) of 
z' k 0 k is 

a{z K 6 K ) = n~ 1/2 ( z' K A K z K ) 1/2 , 

so that (z' K 0 K — z' K d K )/a( z' k Ok) converges in distribution to a standard normal random variable. 
The minimum expected penalty Ik may be estimated by 

I K = - log 7 k(0k) - 0 K m K - 

As the sample size n increases, Ik converges to Ik with probability 1 and in 1 / 2 (Ik — Ik) converges 
in distribution to a normal random variable with mean 0 and variance 

CT 2 (-log£fA'(X)) = /i k AkHk- 

The EASD of Ik is then 

0(1k) = n _1/2 ( m' K A K m K ) 1/2 . 

To verify this claim, let Hk be the function defined by 

n 

Hk( 0) = -n^^loggKiO), 

i =1 

and let Hko = Kk{&k)- Apply the central limit theorem to Hko to show that the scaled 
difference ti 1 ^ 2 {Hko ~ Ik) converges in distribution to a normal random variable with mean 0 and 
variance cr 2 (— log<?A;(X)). Differentiation and Taylor’s theorem show that the difference 

2(Hko ~ Ik) = (Ok ~ 0 K )’C* K (6 K - 0 K ) 

for C* K the covariance matrix of the A"-array u^(Y * K ) for some random vector Y * K with density 
1k(0*k) ex p[(^A') /u A'( x )] for some d* K on the line segment between 6k and Ok- It follows that 
v}/ 2 (Hkq — Ik) converges in probability to 0 and ra 1 / 2 ( Ik — Ik) converges in distribution as 
claimed. 

For 1 < j <2, the distribution function Gj k has estimate Gjk defined by 

Gji<(y) = / g K (*)dx 

for y in B(j), and the quantile function Rjk has estimate RjK defined by 

Gj K (RjK(p )) = P 
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for 0 < p < 1. For a continuously differentiable function fxjy on the set of K arrays, 

Gjkiy) = fi<jy{0 k), and f Kjy has gradient 

T jK(y) = / [ujf(x) - n K ]g K (x)dx 

->B(j,y) 

at Ok- The delta method (Rao, 1973, p. 388) and the fact that GjK is a continuous 
distribution function may be employed to demonstrate that, as the sample size n approaches 
oo, Gj K (y) converges to Gjxiy) with probability 1 for y in B(j), so that |GjK — GjK |, the 
supremum of \Gjx(y) — Gjx(y)\ for y in B(j), converges to 0 with probability 1. In addition, 
[Gkj(jj) ~ GjK(y)]/cr(GjK(y)) converges in distribution to a normal random variable with mean 0 
and variance 1 if 

<r(G jK (y)) = n - 1 / 2 {[T j K (y)]' A K T j K (y)} 1 / 2 . 

Similarly, Rjk(p) converges to RjK'{p ) with probability 1, and [Rjk{jp) — RjK{p)\/&(RjK{p)) 
converges in distribution to a normal random variable with mean 0 and variance 1 if 

a(R jK (p)) = [< gj K (RjK(p))]~ 1 cr(G j K(RjK(p ))) 

and gjK{y) is the marginal density corresponding to Gjk- Thus g\ k('!j) is the integral of 
gK((y,x 2 )) over x '2 in B( 2), and g 2 i<(y) is the integral of gK((xi,y)) over x\ in B( 1). Estimated 
asymptotic standard deviations may be derived by use of obvious substitutions of estimated 
parameters for actual parameters. Thus 

&(Gj K (y)) = n-^iiTMyflAKTjKiy)} 1 / 2 , 

where 

T jK(y)= / K(x)-m K ]g K (x)dx, 

a{R jK {p)) = [gji<{Rji< (p) )] - l v{G jK {Rji< (p))), 

and g jK {y) is the marginal density corresponding to Gjk- 

The estimate ei2A' of the conversion function e\2n{y) from Form 1 to Form 2 satisfies 
ei 2 i<{y) = B 2 k{Gik{]j)) for y in B( 1), and the corresponding estimate <=21 a of e 2 i k satisfies 
621 k{p) = 7 ?iA'(G < 2 A:(y)) for y in B{ 2 ) As the sample size n become large, e\2K{y) converges with 
probability 1 to ei 2 k{p) for y in B( 1), and &2\!<{'!/) converges with probability 1 to &2i k(jj) for 
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y in B( 2). In addition, [e\ 2 K-(y) — ei2A'(?/)]/o'(ei2A'(y)) converges in distribution to a standard 
normal random variable if 


< 7(612 {y)) = n 1//2 {[TiA-(y) - T 2 K{e 12 {y))YA K [T 1K (y) - T 2 A'(ei 2 (y))]} 1//2 , 

In like manner, [e 2 iA(y) — £ 2 \K{y)]/ &{& 2 iK{y)) converges in distribution to a standard normal 
random variable if 

<7(e 2i (y)) = n“ 1 / 2 {[T 2 A'(y) - TiA'(e 2 i(y))] / AA[T 2 A(y) - TiA(e 2 i(y))]} 1//2 , 

The EASD of ei 2 (y) is 

d(ei 2 (y)) = n^ 1 / 2 {[TiA-(y) - T 2 A(ei 2 (y))] / AA[TiA(y) - T 2 A(ei 2 (y))]} 1//2 , 

The EASD of e 2 i(y) is 

d(e 2 i (y)) = n~ 1//2 {[T 2 A'(y) - TiA(e 21 (y))] / AA[T 2 A(y) - Tix(e 2 i(y))]} 1//2 . 


3 Computational Issues 

Given a starting value 0ko, the Newton-Raphson algorithm may be employed to compute 
Ok- Computation of Ok corresponds to minimization of Hk- At step t > 0, a new approximation 
Okh+i) °f @f< is found by the equation 

&K(t+l) = ®I<t + C K \[m K - l-l’Kt] • 


Here 

SAt(x) = lK{@Kt) ex P[^A'i u A'( x )], 

x in B, jK(0Kt ) is defined so that 

/ 5At(x)dx = 1, 

Jb 

O'Kt. = / n K (x)g Kt (x)dx, 

Jb 

iua — liKt i s th e gradient of Hk at Oku and 


c Kt = / K(x) - p^][ua(x) - n Kt ]'g Kt (x)dxL 
J B 


is the Hessian matrix of Hk at OKt- 



The Newton-Raphson algorithm is also employed to evaluate quantile functions (Haberman, 
1996, pp. 426-427). The algorithm considers solution of the equation 

log [G jK (RjK(p))/p\ = 0 

for a given p in the interval (0,1). Given an initial approximation Rjko{p) to Rjk{p) and the fact 
that log {Gjic) has derivative gjK/GjK, the Newton-Raphson algorithm produces approximations 
RjKt{p) to Rjx{p) such that 

RjK(t+i)(p) = RjKt{p ) - G jK {R jK t{p))log[G jK {RjK{p))/p\/gjK{RjKt{p))- 

The Legendre polynomials used in typical cases to define also form the basis for the 
Gaussian quadrature used for evaluation of all integrals on B, B( 1), and B(2) (Abramowitz 
& Stegun, 1965, p. 887). In the case of Gjx{y), the limits of integration of the numerator 
related to Form j are from inf(R(j)) to y rather than from inf(R(j)) to sup (B(j)), so that the 
scaled Legendre polynomials are modified accordingly for Gaussian quadrature. In this paper, 
calculations use 20-point Gaussian quadrature for each dimension. The larger number of points 
than used in previous work on equivalent groups (Haberman, 2008) reflects the numerical challenge 
associated with form scores that are typically highly correlated. 

The rectangle rule of integration has been employed in the literature for the case with 
B(j) = (Cj — 0.5, dj + 0.5) (Wang, 2008), with cj and dj integers, and with the integers from Cj to 
dj used as equally weighted quadrature points. Because the distance between quadrature points 
is not small, this integration approach yields errors that are not negligible. For expectations 
approximation errors are of order (d\ — ci)~ 2 + (c ?2 — C 2 ) -2 , so that they can 
be expected to be relatively minor for very long tests and somewhat more obvious for quite 
short tests. Unfortunately, the proper conditions for use of Sheppard’s corrections (Kolassa & 
McCullagh, 1990) do not apply, so that the most precise approach to evaluation of errors is 
unavailable. It should be emphasized that, in practice, use of Legendre polynomials results in 
both higher accuracy and less computation than does use of the rectangle rule. 

4 Example 

Table 8.2 of von Davier et al. (2004) provides an example of a single-group design with 
Cj = 0 and dj = 20 for 1 < j < 2. To illustrate results, let B( 1) = B( 2) = (—0.5,20.5). The Ukj 
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are defined based on Legendre polynomials, and three sets K are considered. These sets are 


^( 2 ) = {( 1 , 0 ), ( 2 , 0 ), ( 0 , 1 ), ( 0 , 2 ), ( 1 , 1 )}, 

K( 3) = {(1, 0), (2, 0), (3,0), (0,1), (0, 2), (0, 3), (1,1)}, 


and 


K( 4) = {(1, 0), (2, 0), (3,0), (4,0), (0,1), (0, 2), (0, 3), (0,4), (1,1)}. 

Results for parameters are summarized in Table 1. Results in terms of estimated expected log 
penalties are summarized in Table 2. These results suggest that gains beyond the quadratic case 
K( 2) are quite small, although K( 4) differs from K( 3) more than K{ 3) differs from K{ 2). 


Table 1 

Model Parameters 


k 

K( 2) 

Est. EASD 

K( 3) 

Est. EASD 

K( 4) 

Est. EASD 

(1,0) 

0.902 

0.118 

1.012 

0.140 

1.056 

0.153 

(2,0) 

-6.174 

0.226 

-6.226 

0.231 

-6.381 

0.262 

(3,0) 



0.174 

0.124 

0.234 

0.147 

(4,0) 





-0.192 

0.152 

(0,1) 

-0.463 

0.126 

-0.543 

0.138 

-0.477 

0.179 

(0,2) 

-7.152 

0.266 

-7.141 

0.265 

-7.931 

0.307 

(0,3) 



-0.117 

0.117 

-0.034 

0.167 

(0,4) 





-0.836 

0.164 

(1,1) 

15.619 

0.663 

15.624 

0.664 

15.520 

0.661 


Note. EASD = estimated asymptotic standard deviation, est. = estimate. 


Table 2 

Estimated Expected Log Penalties 


K 

Estimate 

EASD 

K( 2) 

4.969 

0.022 

K( 3) 

4.968 

0.022 

K( 4) 

4.960 

0.022 


Note. EASD = estimated asymptotic standard deviation, est. = estimate. 


Not surprisingly, the three choices of K lead to rather similar conversion functions. 
Consider Table 3 for the case of conversion of Form 1 to Form 2. A bit more variability in results 
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exists for very high or very low values, although estimated asymptotic standard deviations are 
more variable than are estimated conversions. Note that results are also similar to those for kernel 
equating (von Davier et al., 2004, chap. 8) shown in Table 4. These results employ a log-linear 
model for the joint distribution of the scores that is comparable to the model defined by A"(3) for 
a continuous exponential family. The log-linear fit preserves the initial three marginal moments 
for each score distribution as well as the covariance of the two scores. As a consequence, the 
marginal distributions produced by the kernel method have the same means and variances as do 
the corresponding distributions of Xu and Xj 2 , but the kernel methods yields the distribution of 
a continuous random variable for the first form with a skewness coefficient that is 0.987 times the 
original skewness coefficient for Xu and a distribution of a continuous random variable for the 
second form that with a skewness coefficient that is 0.983 times the original skewness coefficient 
for X i2 . 


5 Conclusions 

As in the case of equivalent groups (Haberman, 2008), linking forms in a single-group 
design by continuous exponential families appears similar in result to linking the same forms via 
kernel equating. Continuous exponential families offer some possible gains. Unlike kernel equating, 
bandwidths are not required, so that fewer specifications are required. In kernel equating, 
log-linear smoothing and production of continuous distribution functions require distinct steps. In 
the case of continuous exponential families, a model is fit that immediately results in continuous 
distribution functions. 

Although this gain is not apparent in the example, a possible gain from continuous 
exponential families is that application to assessments with unevenly spaced scores or very large 
numbers of possible scores is completely straightforward. Thus direct conversion from a raw score 
on one form to an unrounded scale score on a second form involves no difficulties. In addition, in 
tests with formula scoring, no need exists to round raw scores to integers during equating. 

The single-group design provides the basis for more complex linking designs with anchor 
tests (von Davier et al., 2004, chap. 9), so that results of this report are readily applied to a very 
wide variety of linking problems. 


11 



Table 3 

Comparison of Conversions From Form 1 to Form 2 


Value 

K( 2) 

Est. EASD 

K( 3) 

Est. EASD 

K( 4) 

Est. EASD 

0 

0.111 

0.077 

-0.040 

0.113 

0.404 

0.262 

1 

1.168 

0.128 

0.927 

0.204 

1.404 

0.264 

2 

2.144 

0.135 

1.917 

0.208 

2.269 

0.221 

3 

3.091 

0.130 

2.910 

0.182 

3.121 

0.176 

4 

4.028 

0.120 

3.899 

0.151 

3.987 

0.140 

5 

4.959 

0.108 

4.881 

0.122 

4.874 

0.117 

6 

5.889 

0.097 

5.854 

0.101 

5.785 

0.105 

7 

6.819 

0.086 

6.819 

0.087 

6.721 

0.100 

8 

7.748 

0.076 

7.775 

0.080 

7.679 

0.095 

9 

8.677 

0.069 

8.722 

0.078 

8.653 

0.090 

10 

9.606 

0.065 

9.661 

0.078 

9.634 

0.086 

11 

10.536 

0.064 

10.591 

0.077 

10.611 

0.085 

12 

11.465 

0.066 

11.512 

0.077 

11.574 

0.088 

13 

12.394 

0.073 

12.425 

0.077 

12.514 

0.092 

14 

13.324 

0.081 

13.331 

0.081 

13.427 

0.094 

15 

14.256 

0.091 

14.231 

0.090 

14.310 

0.096 

16 

15.193 

0.102 

15.128 

0.105 

15.166 

0.101 

17 

16.141 

0.113 

16.033 

0.126 

16.003 

0.116 

18 

17.119 

0.121 

16.967 

0.149 

16.838 

0.142 

19 

18.173 

0.123 

17.985 

0.167 

17.716 

0.174 

20 

19.495 

0.094 

19.304 

0.150 

18.842 

0.198 


Note. EASD = estimated asymptotic standard deviation, est. = estimate. 
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Table 4 

Conversions From Form 1 to Form 2 by Kernel Equating 


Value 

Estimate 

EASD 

0 

-0.002 

0.162 

1 

0.999 

0.221 

2 

1.981 

0.221 

3 

2.956 

0.193 

4 

3.926 

0.159 

5 

4.890 

0.128 

6 

5.850 

0.104 

7 

6.805 

0.089 

8 

7.756 

0.080 

9 

8.702 

0.078 

10 

9.643 

0.077 

11 

10.580 

0.077 

12 

11.512 

0.077 

13 

12.439 

0.078 

14 

13.362 

0.083 

15 

14.283 

0.095 

16 

15.206 

0.115 

17 

16.140 

0.140 

18 

17.105 

0.167 

19 

18.155 

0.185 

20 

19.411 

0.158 


Note. EASD = estimated asymptotic standard deviation, est. = estimate. 


13 



References 


Abramowitz, M., & Stegun, I. A. (1965). Handbook of mathematical functions. New York: Dover. 
Gilula, Z., & Haberman, S. J. (2000). Density approximation by summary statistics: An 
information-theoretic approach. Scandinavian Journal of Statistics, 27, 521-534. 

Haberman, S. J. (1996). Advanced statistics: Vol. 1. Description of populations. New York: 
Springer-Verlag. 

Haberman, S. J. (2008). Continuous exponential families: An equating tool (Research Rep. No. 
RR-08-05). Princeton, NJ: ETS. 

Kolassa, J. E., & McCullagh, P. (1990). Edgeworth series for lattice distributions. The Annals of 
Statistics, 18, 981-985. 

Rao, C. R. (1973). Linear statistical inference and its applications (2nd ed.). New York: John 
Wiley. 

von Davier, A. A., Holland, P. W., & Thayer, D. T. (2004). The kernel method of test equating. 
New York: Springer. 

Wang, T. (2008). The continuized log-linear method: An alternative to the kernel method of 
continuization in test equation. Applied Psychological Measurement, 55(7), 527-542. 


14 



